FILTER: An Enhanced Fusion Method for Cross-lingual Language Understanding
نویسندگان
چکیده
Large-scale cross-lingual language models (LM), such as mBERT, Unicoder and XLM, have achieved great success in representation learning. However, when applied to zero-shot transfer tasks, most existing methods use only single-language input for LM finetuning, without leveraging the intrinsic alignment between different languages that proves essential multilingual tasks. In this paper, we propose FILTER, an enhanced fusion method takes data XLM finetuning. Specifically, FILTER first encodes text source its translation target independently shallow layers, then performs cross-language extract knowledge intermediate finally further language-specific encoding. During inference, model makes predictions based on language. For simple tasks classification, translated shares same label shared becomes less accurate or even unavailable more complex question answering, NER POS tagging. To tackle issue, additional KL-divergence self-teaching loss training, auto-generated soft pseudo-labels Extensive experiments demonstrate achieves new state of art two challenging multi-task benchmarks, XTREME XGLUE.
منابع مشابه
An Enhanced Median Filter for Removing Noise from MR Images
In this paper, a novel decision based median (DBM) filter for enhancing MR images has been proposed. The method is based on eliminating impulse noise from MR images. A median-based method to remove impulse noise from digital MR images has been developed. Each pixel is leveled from black to white like gray-level. The method is adjusted in order to decide whether the median operation can be appli...
متن کاملan enhanced median filter for removing noise from mr images
in this paper, a novel decision based median (dbm) filter for enhancing mr images has been proposed. the method is based on eliminating impulse noise from mr images. a median-based method to remove impulse noise from digital mr images has been developed. each pixel is leveled from black to white like gray-level. the method is adjusted in order to decide whether the median operation can be appli...
متن کاملLanguage model adaptation using cross-lingual information
The success of statistical language modeling techniques is crucially dependent on the availability of a large amount training text. For a language in which such large text collections are not available, methods have recently been proposed to take advantage of a resource-rich language, together with cross-lingual information retrieval and machine translation, to sharpen language models for the r...
متن کاملNatural Language Understanding for Information Fusion ∗
Tractor is a system for understanding English messages within the context of hard and soft information fusion for situation assessment. Tractor processes a message through text processors using standard natural language processing techniques, and represents the result in a formal knowledge representation language. The result is a hybrid syntactic-semantic knowledge base that is mostly syntactic...
متن کاملAutomatic Generation of Language-Independent Features for Cross-Lingual Classification
Many applications require categorization of text documents using predefined categories. The main approach to performing text categorization is learning from labeled examples. For many tasks, it may be difficult to find examples in one language but easy in others. The problem of learning from examples in one or more languages and classifying (categorizing) in another is called cross-lingual lear...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i14.17512